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Abstract 

Background: Streptococcus pneumoniae is a globally important pathogen. The Gram-positive diplococcus is a 
leading cause of pneumonia, otitis media, bacteremia, and meningitis, and antibiotic resistant strains have become 
increasingly common over recent years.Alanine racemase is a ubiquitous enzyme among bacteria and provides the 
essential cell wall precursor, D-alanine. Since it is absent in humans, this enzyme is an attractive target for the 
development of drugs against 5. pneumoniae and other bacterial pathogens. 

Results: Here we report the crystal structure of alanine racemase from S. pneumoniae (Alr SP ). Crystals diffracted to 
a resolution of 2.0 A and belong to the space group P3 n 21 with the unit cell parameters a = b = 1 19.97 A, c = 
1 18.10 A, a = p = 90° and y = 120°. Structural comparisons show that Alr SP shares both an overall fold and key 
active site residues with other bacterial alanine racemases. The active site cavity is similar to other Gram positive 
alanine racemases, featuring a restricted but conserved entryway. 

Conclusions: We have solved the structure of Alr SP , an essential step towards the development of an accurate 
pharmacophore model of the enzyme, and an important contribution towards our on-going alanine racemase 
structure-based drug design project. We have identified three regions on the enzyme that could be targeted for 
inhibitor design, the active site, the dimer interface, and the active site entryway. 



Background 

Despite great advances in the development of antibio- 
tics, the most common cause of community-acquired 
pneumonia, Streptococcus pneumoniae, is still a globally 
important pathogen, especially in children and the 
elderly [1]. This Gram-positive diplococcus is a leading 
cause not only of pneumonia, but also otitis media, bac- 
teremia, and meningitis [2,3]. In children, S. pneumoniae 
is estimated to cause more than one-third of the 2 mil- 
lion deaths due to acute respiratory infections [4,5]. In 
the elderly, S. pneumoniae is the most common cause of 
fatal community-acquired pneumonia [6,7]. In adults 
from industrialized countries, pneumococcal pneumonia 
accounts for at least 30% of all cases of community- 
acquired pneumonia admitted to hospital, with a fatality 
rate of 11% to 44% [4]. In addition, co-infection of 
influenza patients with S. pneumoniae is known to 
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exacerbate their clinical outcome [4]: for example, 50% 
or more of the flu-associated mortality in the 1918-1919 
Spanish Flu epidemic is believed to have resulted from 
pneumococcal superinfections [8,9], and S. pneumoniae 
co-infection has been specifically correlated with the 
severity of the recent H1N1 pandemic influenza [10]. 
The rate of antibiotic resistance in S. pneumoniae has 
escalated dramatically since penicillin-resistant strains 
were first detected in the 1970s [11-15]. About 40% of 
pneumococcal isolates displayed multidrug-resistant 
phenotypes (resistance to three or more antibiotics) 
across 38 countries in 2004 [16,17]. To meet the chal- 
lenge of increasing pneumococcal drug resistance it will 
be important to isolate new therapeutic compounds 
effective against S. pneumoniae through the identifica- 
tion of new target enzymes and the development of 
effective inhibitors to these targets. 

The bacterial enzyme alanine racemase (Air; E.C. 
5.1.1.1) uses a covalently-bound pyridoxal 5"-phosphate 
(PLP) cofactor to catalyze the racemization of L-alanine 
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and D-alanine, the latter being an essential component 
of the peptidoglycan layer in bacterial cell walls [18]. 
Microbiological evidence has consistently shown this 
enzyme to be essential in bacteria [19]. One apparent 
exception was found for the Mycobacterium smegmatis 
enzyme, which was able tolerate an insertion in its ala- 
nine racemase gene [20]. But this exception was dis- 
proved with the report of an alanine racemase deletion 
mutant in M. smegmatis that did not grow without D- 
alanine supplementation [19]. S. pneumoniae, unlike 
Escherichia coli or Pseudomonas aeruginosa, contains 
only one gene that codes for alanine racemase [21]. The 
lack of alanine racemase function in eukaryotes [22] 
makes this enzyme an attractive target for antimicrobial 
drug development. Structural studies are crucial to 
structure-based drug design [23-25], and solving the 
crystal structure of alanine racemase from S. pneumo- 
niae (Alr SP ) is a crucial step towards designing inhibitors 
of this enzyme. 

To date, crystal structures of alanine racemase 
enzymes from seven different bacteria have been pub- 
lished: Geobacillus stear other mophilus (Alr GS ) [26-31], 
P. aeruginosa (DadX PA ) [32], Streptomyces lavendulae 
(Alr SL ) [33], Mycobacterium tuberculosis (Alr MT ) [34], 
Bacillus anthracis (Alr BA ) [35,36], E. coli (Alr EC ) [37], 
and Enterococcus faecalis (Alr EF ) [38]. Structures of this 
enzyme from a further six microorganisms have been 
deposited in the PDB: Bartonella henselae (PDB ID 
3KW3), Oenococcus oeni (3HUR and 3C08), Pseudomo- 
nas fluorescens (20DO), Actinobacillus succinogenes 
(3C3K), Corynebacterium glutamicum (2DY3), and Sta- 
phylococcus aureus (3002). In all of these structures, 
Air is a homodimeric enzyme formed by a head-to-tail 
association of two monomers. Each monomer is com- 
posed of an N-terminal a/p barrel and an extended (3- 
strand domain at the C-terminus. The active site in 
each monomer is located in the centre of the a/p barrel 
and contains a pyridoxal phosphate (PLP) co-factor 
covalently connected to a lysine residue by an internal 
aldimine bond. The catalytic mechanism is thought to 
involve two bases, the same lysine, and a tyrosine con- 
tributed by the opposite monomer [30,39,40]. The entry- 
way to the active site and the PLP binding site consists 
of residues from loops in the a/p barrel domain of one 
monomer and residues from the C-terminal domain of 
the other monomer, and is roughly conical, with its base 
oriented toward the outside of the enzyme [34]. Struc- 
tures of alanine racemase in complex with substrate 
analogs [27,28,30-32] and site-directed mutagenesis of 
the enzyme [31,40,41] have elucidated the reaction 
mechanism of the enzyme and verified the key roles of 
active site residues. Structures of alanine racemase com- 
plexed with alanine phosphonate and D-cycloserine 
(DCS) show that these inhibitors covalently bind to the 



PLP cofactor, which explains their ability to inhibit 
eukaryotic PLP-containing enzymes in a non-specific 
manner [27,30,37,38]. 

Determining the structure of alanine racemase from a 
range of bacterial species is an important step towards 
its full characterization in anticipation of inhibitor 
design. Here, we report the structure of Alr SP , and com- 
pare it with other published alanine racemase structures. 
This crystal structure will contribute useful information 
towards our structure-based drug design research aimed 
at the identification and development of alanine race- 
mase inhibitors. 

Results and discussion 

Structure determination and refinement 

Crystals of Alr S p suitable for X-ray diffraction were 
grown as described previously [21]. Crystals diffracted to 
a resolution of 2.0 A and belong to the space group 
P3i21 with the unit cell parameters a = b = 119.97 A, 
c = 118.10 A, a = p = 90° and y = 120°. The structure 
of Alr S p was solved by molecular replacement using 
CNS [42] and Alr GS (PDB ID 1SFT) [29] without the 
PLP cofactor as a search model. Refinement was carried 
out initially with CNS, then completed with TLS refine- 
ment [43] in Refmac5 [44]. After structure solution and 
refinement, the final model of Alr SP , validated using 
PROCHECK [45] has 92.7% of residues in the most 
favored regions of the Ramachandran plot, 6.9% of resi- 
dues in the additionally allowed regions and 0.3% of 
residues in the generously allowed regions. The struc- 
ture has root-mean-square (r.m.s.) deviations from ideal- 
ity for bond lengths of 0.015 A and for angles of 1.45°. 
Further data collection and refinement statistics are 
presented in Table 1. 

Overall structure of Alr SP 

Alr S p forms a homodimer in which the two monomers 
form a head-to-tail association, typical of that seen in 
other alanine racemases. Each monomer has an eight- 
stranded a/p barrel domain (residues 1-238) and an 
extended P-strand domain (residues 239-367) (Figure 
1A). The a/p barrel of one monomer is in contact with 
the p-strand domain of the other monomer (Figure IB). 
Although the two monomers have very similar folds, 
they are crystallographically distinct in this crystal form 
(Table 1), and noncrystallographic symmetry was not 
used in refinement. The r.m.s. difference between the 
Ca atoms of the two monomers after superposition is 
0.38 A, and the average B-factors of monomers A and B 
are 38.4 and 46.9 A 2 , respectively. As with other alanine 
racemases, the Alr SP homodimer contains two active 
sites, each composed of residues from the a/p barrel of 
one monomer and residues from the P-strand domain 
of the other. The pyridoxal phosphate (PLP) cofactor is 
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Table 1 Data collection and structure refinement 
statistics 


Data collection 


Unit cell parameters 


a = 1 1 9.97 A, b = 1 1 9.97 A, 




c = 118.10 A 




a = 90°, p = 90°, y = 120° 


Space group 


P3i21 


X (A) 


1.5418 


Mosaicity 


0.48 


Observations 


475265 


Unique reflections 


66748 


R-merge° (%) 


8.3 (68.2) 


Completeness (%) 


99.6 (95.4) 


<l/c> 


21.3 (1.7) 


Refinement statistics 


Resolution (A) 


^» *"> r\ '"i r\r\ r\ r n r\r\\ 

23.03 - 2.00 (2.05 - 2.00) 


Reflections 


63336 (441 2) 


Total atoms 


6161 


R-fartnr b (0/ n \ 
n IdLLUI \/0) 


1 f\ R (11 J} 

1 O.O \DZ.Z) 


Rfree (%) 


20.0 (35.5) 


Average B-factors (A 2 ) 




Wilson B-factor 


33.2 


All atoms 


42.7 


Main chain atoms 


41.8 


Side chain atoms and waters 


43.6 


Waters 


44.5 


R.m.s. deviations 




Bond lengths (A) 


0.015 


Bond angles (deg) 


1.45 


No. of residues 


734, 100% 


No. of protein atoms 


5615 


No. of PLP atoms 


30 


No. of benzoic acid atoms No. of 


9 507 


water molecules 




Residues in the Ramachandran plot 




Most favored regions 


588, 92.7% 


Additionally allowed regions 


44, 6.9% 


Generously allowed regions 


2, 0.3% 


Disallowed regions 


0, 0% 


a R-merge = S|/ obs -/ avg |/E|/ avg | 
b R-factor = X|F obs -F calc |/S|F obs | 

Values in parenthesis are for the highest resolution shell. 

connected to Lys40 through an internal aldimine bond 
and resides inside the a/p barrel domain. 

As an incidental finding, the Alr S p structure contained 
additional electron density within the A monomer, at the 
end of helix 1 in the N- terminal a/p barrel domain. This 
planar density resembled a carboxylated aromatic ring, 
therefore a benzoic acid molecule, which fitted and refined 
well, was modeled into this region, even though the com- 
pound was not added to purification or crystallization 



conditions (topology and parameters obtained from the 
Hetero-compound Information Centre-Uppsala, HIC-UP 
[46]). It is situated some distance away from both the 
active site entryway and the dimer interface. 

Structural and biochemical comparison with closely 
related alanine racemases 

As noted in our previous publication [21], Alr SP displays 
a high level of sequence similarity with other alanine 
racemases. The structure-based sequence alignment in 
Figure 2 demonstrates this similarity with alanine race- 
mases from other Gram-positive bacteria: Alr EF (which 
has 52% sequence identity with Alr SP ), Alr GS (46% iden- 
tity), Alr BA (38% identity), and Alr SL (36% identity). 
Regions absolutely conserved across all of these enzymes 
include the characteristic PLP binding site motif near 
the N-terminus (AVVKANAYGHG), the two catalytic 
amino acid residues of the active center (Lys40, Tyr263'; 
throughout this paper, primed numbers denote residues 
from the second monomer) and the eight residues mak- 
ing up the entryway to the active site (inner layer: 
Tyr263', Tyr352, Tyr282', and Alal69; middle layer: 
Arg307', Ile350, Arg288', and Aspl70). 

Superposition of the Ca atoms of monomer A from 
Alr SP with equivalent alanine racemase domains from 
other Gram-positive bacteria confirms the overall topo- 
logical similarity between these structures (Figure 3A). 
There are minor conformational differences between 
these alanine racemases at the N- and C-termini and 
some loops in the a/p -barrel domain. Alr S p is similar in 
length to Alr SL and Alr EF ; whereas Alr GS and Alr BA have 
15 to 19 extra residues at the C- terminus that form an 
extra P -strand and helix/turn which contact the N-ter- 
mini and the closest two helices of the a/p -barrel of 
each structure, and do not form part of the active site. 
The significance of these extra residues or lack thereof 
is unknown; future mutagenesis or domain-swap experi- 
ments may help to uncover their function. 

Within each alanine racemase, the C- and N-terminal 
domains of each monomer are structurally distinct, and 
the hinge angle varies between the different enzymes 
[32,36], thereby preventing the optimal superposition of 
whole monomers. Overlaying the Ca atoms of Alr SP and 
alanine racemase structures from other Gram-positive 
bacteria results in average r.m.s. differences of 1.16-1.57 
A (Table 2), but when the N-terminal and C-terminal 
domains from Alr SP are superimposed separately, the C- 
terminal domain is shown to be more conserved (aver- 
age r.m.s. differences of 0.49-1.24 A), than the N-term- 
inal domain (r.m.s. of 1.30-1.92 A). Domain boundaries 
and residues used in these superpositions are listed in 
Table 3. The subset of residues found in the active site 
of Alr S p superpose very well with the equivalent residues 
of the other structures (r.m.s. of 0.36-0.67 A). C-termini 
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Figure 1 Structure of alanine racemase from S. pneumoniae. (A) Ribbon diagram of the alanine racemase monomer with (3-sheets colored 
green and a-helices colored gold. (B) Ribbon diagram of the alanine racemase dimer where one monomer is colored blue and the opposite 
monomer red. The N'-pyridoxyl-lysine-5'-monophosphate or LLP residue (PLP cofactor covalently bound to lysine; black or grey spheres) resides 
in the a/|3 barrel domain of the active site. The active site is composed of residues from the a/(3 barrel domain of one monomer and residues 
from the (3-strand domain of the other monomer. 



and active site residues superpose particularly well 
between Alr SP and Alr GS , and also between Alr B A and 
Alr EF . This structural similarity explains why Alr GS was 
such a successful molecular replacement model Varia- 
bility in the N-terminal domain is further illustrated by 
superposition of the N-terminal domains of Alr SP and 
its closest available homolog, Alr EF , which reveals signif- 
icant deviations in Ca positions (>1.8 A) for five 
regions: residues 27-29, residues 53-58, residues 109- 
122, residues 150-156, and residues 192-196 (Figure 3B). 
The sequence in these regions is not highly conserved 
and they lie far from the active site. Superposition of the 
C-terminal domains from these structures shows no 
region with Ca differences greater than 1.7 A. Overall, 
alanine racemase structures seem to tolerate significant 
alterations in the backbone of the a/p -barrel and P- 
domain and still retain almost identical active site resi- 
due locations. 

The kinetic properties for Alr SP [21] are within the 
range of those previously observed for other bacterial 
alanine racemases (Table 4). The K M for L-alanine is 1.9 
mM and V max for the racemization of L- to D-alanine is 
84.8 U/mg, where one unit is defined as the amount of 



enzyme that catalyzes racemization of 1 [imol of sub- 
strate per minute. In the other direction, the K M for D- 
alanine is 2.1 mM and V max for the racemization of L- 
to D-alanine is 87.0 U/mg. However, the V max for the S. 
pneumoniae enzyme is more than one order of magni- 
tude lower than that reported for the G. stear other mo - 
philus and E. faecalis enzymes, even though the active 
site of Alr SP has high sequence and structural similari- 
ties with these alanine racemases. Differences of up to 
three orders of magnitude have been reported in this 
family despite very similar active sites. 

Hinge angle 

The hinge angle of the A monomer of Alr SP , formed by 
the Ca atoms of residues 99, 38 and 270 in the N-term- 
inal a/p barrel domain and the C-terminal P -strand 
domain, is 132.3°. This is well within the range of hinge 
angles found between corresponding residues in the 
other Gram-positive alanine racemase structures (127.6° 
for AlrBA? 129.4° for Alr GS , 131.6° for Alr EF , and 138.2° 
for Alr SL ). The difference in the degree of tilt between 
the C-terminal domains for the five structures can be 
seen in Figure 3A. Hydrogen bonding between the C- 
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Figure 2 Structure-based sequence alignment of the five solved alanine racemase structures from Gram-positive bacteria Structures 
are from S. pneumoniae, G. steorothermophilus [29], E. faecolis [38], B. anthracis [36] and S. lavendulae [33]. The black box encloses the conserved 
PLP binding site, the asterisks (*) mark the PLP-bound Lys residue and the catalytic Tyr residue, the diamond (♦) marks the location of the 
carbamylated Lys residue, and the residues constituting the entryway to the active site are marked with either I (inner layer) or M (middle layer). 
Residues that form intermonomer interfaces are highlighted in light green. The purple shading is proportional to the degree of sequence 
identity across the alignment. 



and N-terminal tails of opposite monomers was pro- 
posed by LeMagueres et al to account for the distinct 
domain orientations of Alr MT and DadX PA [34]. Alanine 
racemase structures with extra residues at the N- and 
C-terminal tails, such as Alr GS and Alr B A> often form 
these hydrogen bonds, which are associated with smaller 
hinge angles (127.6° for Alr BA > 129.4° for Alr GS )[36]. 

Although the hinge angle clearly varies from species to 
species for this enzyme, the active sites superpose very 
well. Further, there is no correlation between hinge 
angle and Vmax (data not shown). On the other hand, 
there is some correlation between alanine racemase 
activity and bacterial doubling time. For example, the 
enzyme from the slow growing M. tuberculosis is very 
slow compared to the same enzyme from the rapid 
growing M. smegmatis species. It has previously been 
noted that only the dimeric form of the enzyme is active 



[47] and that many of the alanine racemase enzymes 
with the strongest monomer-dimer association have 
been found to be the most active [48]. A recent report 
has appeared looking at how enzyme activity in different 
alanine racemases relates to self- association affinity and 
this report confirms this assertion [49]. 

Active site 

The geometry and identities of the active site residues 
of Alr SP (Figure 4A) are very similar to that of other ala- 
nine racemases (Figure 4B). The main components of the 
Alr S p active site include the PLP cofactor covalently 
bound to Lys40 (forming an N'-pyridoxyl-lysine-S'-mono- 
phosphate or LLP residue), the catalytic base residue 
Tyr263' which lies at the beginning of helix 11 in the 
P -strand domain (contributed by the opposite monomer 
to that providing Lys40), and a hydrogen-bonded network 
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B 

Figure 3 Superposition of alanine racemase monomers from 
Gram-positive bacteria. (A) Ca atom traces of alanine racemases 
from G. steorothermophilus (yellow) [29], £ foecolis (green) [38], B. 
anthrocis (blue) [36], S. lovendulae (red) [33], and 5. pneumoniae 
(pink). The superposed N-terminal a/(3 barrel domains are oriented 
on the bottom of the picture and the C-terminal |3-strand domains 
on the top. Spheres represent the three structurally equivalent 
residues used to measure the hinge angle in each structure. The 
double-headed arrow indicates the variation between hinge angles. 
The PLP-bound Lys residue from Alr SP is shown in black. (B) 
Superposed ribbon representations of the N-terminal domains from 
E. foecolis (green) [38] and 5. pneumoniae (pink), with the most 
divergent regions colored orange. 



of residues (Figure 5). Based on the reaction mechanism 
of alanine racemases proposed elsewhere [29-31,50], 
Alr S p operates by a two-base mechanism where the a- 
hydrogen of either D-alanine or L-alanine is abstracted 
and added by a different catalytic residue. Therefore, in 
the D- to L- direction, the reaction occurs with D-alanine 
binding to produce an external aldmine between PLP and 
D-alanine. Lys40 then abstracts the a-hydrogen to pro- 
duce a carbanonic quinonoid intermediate. Next, Tyr263' 
adds a proton to the Ca of the intermediate from the 
opposite side to produce an external aldimine between 
PLP and what is now L-alanine. Subsequent transaldimi- 
nation liberates L-alanine and regenerates the LLP form 
of the enzyme. 



Table 2 Average r.m.s. differences (A) between the Ca 
atoms of Alr SP and alanine racemase structures from 
other Gram-positive bacteria 





PDB 


Whole 


N- 


C- 


Active 




ID 


monomer 


terminus 


terminus 


site 


Alr GS 


1SFT 


1.23 (46%) 


1.30 (41%) 


0.57 (56%) 


0.36 (66%) 


Alr SL 


1VFH 


1.57 (38%) 


1 .92 (34%) 


1.24 (41%) 


0.67 (46%) 


Alr BA 


3HA1 


1.29 (45%) 


1.59 (41%) 


0.49 (53%) 


0.38 (65%) 


Alr EF 


3E5P 


1.16 (53%) 


1 .48 (52%) 


0.54 (56%) 


0.46 (71%) 



Numbers in parenthesis denote sequence identity with AlrSP, (%sequence 
identity = Nidentity/Naligned). 



In the LLP moiety, the C4" atom of the PLP cofactor 
is linked to the NZ of Lys40 by a double bond in the 
trans- configuration, forming an internal aldimine as in 
other alanine racemase structures [29,31-33]. The PLP 
cofactor is further stabilized by hydrogen bonds with 
the side chains of six residues (Tyr44, Argl36, Hisl65, 
Ser203, Arg218 and Tyr352) and main chains of three 
residues (Ser203, Gly220, Asp221; Figure 4A). The 
hydrogen-bonded network also includes residues His 199 
and Tyr263", and was first described in Alr GS [29]. All 
of these residues are strictly conserved across the Gram- 
positive structures, except for Asp221, which is replaced 
by an He in Alr B A and Alr GS , a Val in Alr EF , and a Leu 
in Alr SL [29,33]. 

We observed electron density consistent with a carba- 
mylated lysine at the NZ terminus of Lys 129, as seen in 
most other alanine racemase structures. Lys 129 refined 
well as a carbamylated residue in this structure and is 
hydrogen bonded to the neighboring arginine residue. 
Shaw et al. [29] noted that this lysine residue is highly 
conserved, and later studies suggested that it helps to 
position the nearby arginine residue (Argl36 in Alr S p) 
which interacts with the substrate's carboxylate group 
through hydrogen-bonding [28,32]. Argl36 is further 
positioned in Alr SP by a hydrogen bond to Ser309. 
Sequences of alanine racemases that contain a lysine in 
position 129 almost always have an accompanying serine 
or cysteine residue in the equivalent of position 309 
[36]. Recently, the Alr B A structure was found to contain 
an aspargine residue bound to a chloride ion at the 
equivalent position of Lys 129, which appears to play the 
same role as the carbamylated Lys of positioning 
the active site arginine [36]. An alignment of alanine 
racemase sequences by Counago et al. revealed that the 
presence of an aspargine residue can occur at the 
equivalent position of Lys 129 in Alr SP and is likely to be 
indicative of an internal chloride within the active site 
in the place of a carbamylated lysine. Notably this 
change from Lys to Ser appears to always be accompa- 
nied by a threonine at the equivalent position of Ser309, 
even though the threonine does not directly interact 
with the chloride ion. 

The environments on either side of the pyridine ring 
of PLP are quite different, as reported previously for 
Alr GS [29,33]. The side of the PLP that faces the dimer 
interface is polar in character, with many hydrophilic 
amino acid residues (including carbamylated Lysl29, 
Argl36, Hisl65 and Arg218), several water molecules 
and the hydrogen-bond network. The nonpolar side of 
PLP, in contact with the a/p barrel, contains several 
hydrophobic residues (Val38, Leu83, Leu85 and 
Phel63), no charged residues and no water molecules. 

As observed in several other alanine racemase struc- 
tures [29,32,34,36], we identified extra density in the 
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Table 3 Residues used in r.m.s. calculations 







Alr EF 


Alr SP 


Alr GS 


Alr BA 


Alr SL 


N-terminus 


monomer A 


2-243 


1-239 


2-241 


4-245 


3-246 


C-terminus 


monomer A 


244-371 


240-367 


n/ii too 

242-388 


246-389 


247-378 


Active site 


monomer A 


38-44 


38-44 


37-43 


39-45 


36-42 






62-66 


61-65 


61-65 


63-67 


60-64 






83-87 


82-86 


82-86 


84-88 


81-85 






1 nn-1 C\A 


1 m -1 


I U I I Uj 


1 07 


1 00-1 04 
I UU I u^- 






128-141 


125-138 


125-138 


127-140 


125-138 






164-172 


160-168 


161-169 


163-171 


163-171 






201-208 


197-204 


198-205 


203-210 


203-210 






219-226 


215-222 


216-223 


221-228 


221-228 






353-360 


349-356 


351-358 


356-363 


358-365 




monomer B 


265-268 


261-264 


263-266 


268-271 


268-271 






311-316 


307-312 


309-314 


314-319 


315-320 



active site of Alr SP adjacent to the PLP cofactor (Figure 
4C). The position of this density corresponds to that of 
the acetate modeled in Alr GS . In other structures, this 
location has been reported to contain propionate, ala- 
nine phosphonate, and a putative substrate molecule in 
DadXp A [28-30,38]. Water molecules in the same loca- 
tion are found in the Alr MT and Alr SL structures. After 
unsuccessfully attempting to model a variety of small 
molecules into the extra density, including acetate, we 
left this region of the model empty. 

Active site entryway 

The entryway to the active site in Alr SP comprises the 
a/p barrel domain of one monomer and residues from 
the C-terminal domain of the other monomer, and is 
about 13 A from the active site C4" atom of PLP. The 
entryway has a funnel-like shape, with its widest end 
towards the outside of the enzyme, narrowing as it 
approaches the PLP. The highly conserved residues 
comprising the entryway are distributed in layers begin- 
ning at the PLP site (Figures 6 A and 6B): charged near 
the entrance, and mainly hydrophobic near the active 



site [33,34]. Mutagenesis has shown that these hydro- 
phobic residues have an important role in controlling 
the substrate specificity of alanine racemase [51]. In 
Alr SP , the inner layer is made up of residues Tyr263', 
Tyr352, Tyr282', and Alal69. Residues Arg307', Ile350, 
Arg288', and Aspl70 make up the middle layer. The 
residues composing the middle and the inner layers are 
strictly conserved between Alr SP , Alr EF , Alr BA , Alr GS , 
and Alr S L- An outer layer exists comprised of Thr345, 
Glul71, Val232 and Gly264', but these residues, which 
are able to interact with solvent directly, are not well 
conserved. 

The Alr SP active site entryway includes the conserved 
pair of acidic residues Aspl70 and Glul71. The equiva- 
lent residues in E. coli, Aspl64 and Glul65, have been 
posited to play a role in substrate orientation [37]. 
Although the active sites of alanine racemases in general 
are moderate in size, it is difficult for inhibitors to 
access because of a constriction in the entryway corridor 
[34]. The smallest constriction in the entryway corridor 
of Alr SP is between Tyr263' and Tyr352 of the inner 
layer (Figure 6A), which provide an opening width of 



Table 4 Kinetic parameters for the racemization of L- to D- and D- to L-alanine by alanine racemases from Gram- 
positive bacteria 







D to L direction 






L to D direction 




Enzyme [ref] 


UmM) 


V mflX (U/mg) a 


kc^min" 1 ) 


K m (mM) 


V mflX (U/mg) a 


kc^min" 1 ) 


Alr SP [21] 


2.1 b 


87.0 b 


NR 


1.9 b 


84.8 b 


NR 


Alr GS [76] 


2.7 C 


1400 c 


NR 


4.3 C 


2550 c 


NR 


Alr SL [77] 


0.4 C 


NR 


3800 c 


0.4 C 


NR 


3300 c 


Alr BA [36] 


2.8 b 


101 b 


NR 


NR 


NR 


NR 


Alr EF [78] 


2.2 C 


1210 c 


~2340 c 


7.8 C 


3570 c 


~2340 c 



a One unit is defined as the amount of enzyme that catalyzes racemization of 1 umol of substrate per minute. 
b At 23°C. c At 37°C. NR: not reported. 
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Figure 4 Active site of alanine racemase from 5. pneumoniae. 

(A) Electron density 2F Q -F C map of the active site contoured at 1.5a, 
excluding solvent. Residues from the first monomer are colored 
pink, residues from the second monomer are blue and are denoted 
with primed numbers. The PLP-bound Lys residue (LLP) is grey. (B) 
Superposition of the active site residues from Gram-positive alanine 
racemase structures with Alr SP ; only S. pneumoniae residues are 
labeled. Residues pictured are from G. steorothermophilus (yellow) 
[29], E. foecolis (green) [38], B. anthrocis (blue) [36], S. lovendulae (red) 
[33], and S. pneumoniae (pink). The chloride ion from the B. 
anthracis structure is depicted as a blue sphere. (C) Unmodeled 
electron density (green) found in the active site. 2F Q -F C (light blue) 
and F Q -F C (green and red) maps are contoured at 1.5 and 3.0 a, 
respectively. Residues are colored and labeled as described for 
Figure 4A. 



only about 2.6A for an active site inhibitor to pass 
through (the distance between the closest atoms of 
these two side chains with the van der Waals radius for 
each atom subtracted). As a result, the substrate entry- 
way itself has been proposed as an alternative target for 
inhibitor development [32,34]. Wang et al. [52] have 
proposed this idea previously for another enzyme, his- 
tone deacetylase-like protein. 

Dimer interface 

Dimerization is essential for the catalytic activity of ala- 
nine racemase [47]. Both monomers contribute to the 
overall composition of the active site, the alanine entry- 
way, and the binding pocket. Within the Alr SP dimer 
interface there are 33 hydrogen bonds and 10 salt bridges 
(Table 5). There are no disulfide or covalent bonds across 
the interface. 91 residues from each monomer are 
involved in intermonomer interactions. The buried sur- 
face areas of the A and B monomers are 3035 and 3020 
A 2 , respectively; both values are 19% of the total surface 
area of each monomer. The interface surface area is simi- 
lar to that seen in the closely related Alr EF and Alr GS 
(Table 5). 30% of the interface residues in Alr SP are polar, 
47% are non-polar, and 22% are charged. 

Disruption of the dimer interface could be used to 
inhibit alanine racemase activity, and has been success- 
fully used on drug targets in HIV and HCV [53-55], and 
on caspases [56]. Inspection of the residues that partici- 
pate in the dimer interface of Alr SP on a structure-based 
sequence alignment (Figure 2) makes it apparent that 
that many of these residues are highly conserved, and 
also participate in substrate guidance (such as middle 
and inner entryway residues Tyr282', Tyr352, Arg307', 
Ile350, Arg288', Aspl70) or catalysis (e.g. Lys40, 
Tyr263'). 

Pentagonal water molecules in the active site 

A cluster of hydrogen-bonded water molecules forms an 
ordered pentagonal ring and some adjacent partial rings 
in the active site entryways of both monomers of Alr SP 
(Figure 7). The pentagonal ring waters are located adja- 
cent to the substrate binding site and between residues 
Tyr263" and Tyr282". They are positioned at the inter- 
face of monomer A and B and appear to be involved in 
the dimer interface, making direct or indirect hydrogen 
bonds with interface residues (Asp 170, Tyr263', Tyr282', 
Tyr288', Arg307', Tyr352). The distance between the 
water oxygen atoms that form each side of the pentagon 
is about 2.7 A. The pentagonal ring is hydrogen-bonded 
directly to the protein at five atoms (Tyr282' OH, 
Arg307' NH1, and Arg288' NH2 and NE from the entry- 
way inner and middle layers, and Val308' O) and makes 
hydrogen bonds with other waters both deeper in the 
active site and at the outer region of the entryway. 
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Figure 5 Schematic diagram of polar interactions around PLP in the active site of alanine racemase from S. pneumoniae. For clarity, 
interactions with water molecules have not been included. Primed numbers denote residues from the second monomer. This figure was drawn 
after LeMagueres et al. [32]. 




15.0001 



Figure 6 Molecular surface representations of the entryway to the active site of alanine racemase from S. pneumoniae. (A) The surface 
of three layers of entryway residues: residues comprising the inner layer are pink (here, the constricting Tyr352 and Tyr263' residues can be 
seen), the middle layer residues are orange, and the outer layer residues are blue. The PLP cofactor is colored green. Primed numbers denote 
residues from the second monomer. (B) Surface of the entryway colored by electrostatic potential (same view as in A). 
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Table 5 Intermonomer interactions for alanine racemases from Gram-positive bacteria calculated using PISA [69] and 
PROTORP [70] 





Buried surface area (A 2 ) 


H-bonds 


Salt link 


# residues 




% interface residues 






Monomer A 


Monomer B 






A 


B 


polar 


non-polar 


charged 


Alr SP 


3035 


3020 


33 


10 


91 


91 


30 


47 


22 


Alr GS 


3058 


3109 


43 


17 


89 


88 


27 


43 


30 


Alr SL 


2800 


2799 


26 


6 


87 


87 


39 


43 


18 


Alr BA 


3529 


3529 


45 


10 


93 


95 


28 


44 


27 


Alr EF 


2936 


2936 


38 


12 


86 


86 


32 


41 


28 




Figure 7 Pentagonal ring waters near the substrate binding site in alanine racemase from S. pneumoniae. The electron density 2F 0 -F C 

map is contoured at 0.8c. Residues are shown as sticks, red spheres represent water molecules, and dashed yellow lines represent hydrogen 

bonds. Residues from the first monomer are colored pink, residues from the second monomer are blue and are denoted with primed numbers. 

The PLP-bound Lys residue (LLP) is grey. For simplicity, only some of the residues are shown, 
v J 
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The hydrogen bond network we have identified could 
be facilitating substrate movement or proton transfer 
into the active site. Analysis of conserved water sites in 
Alr GS has been reported previously and the authors pos- 
tulated that these sites could be involved in proton 
transfer or solvent shift into the active site [57]. In the 
high resolution structure of the protein crambin, Teeter 
reported pentagonal rings of water molecules which 
were felt to have a role in stabilizing protein structure 
or in catalysis [58]. Pentagonal ring waters have not yet 
been reported for other alanine racemases, but depend- 
ing on which monomer is observed, the Alr BA and Alr GS 
structures have 3 to 4 of the 5 pentagonal waters pre- 
sent in Alr SP , the Alr SL structure has 2 of the 5 pentago- 
nal waters, and the Alr EF structure does not have any 
pentagonal ring waters, although this could be due to 
limits in structural resolution. 

Alanine racemase as a target for drug design 

In this section we review some of the challenges 
encountered in developing inhibitors for alanine race- 
mases as a family and we explain the contribution of 
the S. pneumoniae structure to this process. Finally we 
offer our assessment of the most useful approaches to 
alanine racemase inhibitor development. 

Challenges involved in designing inhibitors for alanine 
racemase are easy to identify. To begin with, there have 
been few reports to date of alanine racemase inhibitors 
with any true specificity. Incorporating features of the 
active site in drug design has been challenging because 
the structure of the active site is thought to have limited 
accessibility. Further, several inhibitors have been found 
to cross react with human enzymes that contain PLP. 
Even so, our analysis of alanine racemase structures has 
allowed us to identify key features that could be incor- 
porated into the inhibitor development process. Since 
these key features are also present in the S. pneumoniae 
enzyme structure, it confirms that these features are not 
artifacts or incidental findings but conserved features 
that can be targeted in the development of a class of 
inhibitors specific to bacterial alanine racemases. There- 
fore the structure of the S. pneumoniae enzyme is valu- 
able to racemase drug design efforts. 

In addition, one new feature relevant to the traditional 
drug design approach of blocking the active site that we 
report here for Alr SP is the pentagonal water network 
within the active site. Several of these waters are con- 
served in other alanine racemase species. That being the 
case, the conserved waters could be incorporated within 
an in silico pharmacophore as a polar site capable of 
receiving or donating a hydrogen bond depending on 
its protonation state. Unfortunately, to date testing of 
compounds identified from in silico screening has not 
resulted in the identification of strong inhibitors. 



The earliest drug development work on alanine race- 
mase was carried out in the absence of a crystal struc- 
ture and resulted in the development of a cycloserine, a 
small, covalent inhibitor of alanine racemase and other 
PLP-containing enzymes [59] that lacks any specific 
interactions with elements in the active site. More 
recent in silico drug design work carried out using the 
structure of alanine racemase has defined a pharmaco- 
phore situated within the active site near the alanine 
racemase acetate binding site, a site reported consis- 
tently within alanine racemase structures [60]. However, 
analysis of the narrow entryway to the active site PLP 
suggests that access to the proposed interior binding 
pockets of the enzyme is likely to be limited, especially 
for larger compounds [32,34]. To be an effective drug tar- 
get it is important the active site be accessible, therefore 
standard structure-aided inhibitor design approaches are 
limited for alanine racemase. 

Since, as illustrated by the Alr SP structure, entryway to 
the interior binding pocket of alanine racemase is very 
tight, the drug design problem for alanine racemase 
resembles a "ship in a bottle" (SIAB) dilemma. Once 
inside the interior pocket, the compounds proposed to 
bind to the active site would fit well but these com- 
pounds may only make it to the interior with difficulty 
[32,34,36]. This view is of course an oversimplification, 
as the entryway is likely to 'breathe' and adjust, and 
there is a monomer-dimer equilibrium for alanine race- 
mase that would affect the geometry and accessibility of 
internal active site cavities. However, the restricted 
access and size of the alanine racemase active site is one 
reason it has not been targeted by major pharmaceutical 
companies in the recent past (Bussiere, Dirk; personal 
communication). 

If a drug design project involving an enzyme with a 
SIAB active site is to be successful, there are four 
obvious approaches to inhibitor development: high 
throughput screening (HTS), blocking the opening, 
interfering with active site assembly, or developing 
drugs that enter in one shape and adopt a new confor- 
mation after binding, thus trapping them in the active 
site. 

HTS would bypass any of the complexities associated 
with active site access and would provide a set of com- 
pounds that inhibit the enzyme by any and all means, to 
be deconvoluted later. Given that the active site features 
we describe for the S. pneumoniae enzyme are highly 
conserved in the bacterial structures reported to date, 
the alanine racemase inhibitors identified by HTS would 
likely be broad-spectrum in their action. But a broad 
spectrum of activity should not be viewed in a negative 
light, as almost all major classes of antibiotics developed 
to date are broad spectrum. This includes beta-lactams 
like penicillin and cephalosporins, fluoroquinolones, 
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tetracyclines, even macrolides. In fact the only specificity 
among anti-bacterial classes currently in use would be 
that some target preferentially Gram-positives, Gram- 
negatives, mycobacteria or anaerobes. 

Blocking the opening would involve the design of 
compounds that interact with residues in the entryway 
and that extend toward the PLP moiety, but that might 
not reach the interior binding pocket. In our previous 
work on the alanine racemase from P. aeruginosa, 
M. tuberculosis and B. anthracis, we described a highly 
conserved and layered entryway to the active site that 
contains both hydrophobic and polar features. The 
hydrophobic regions are bound by three tyrosines and 
an alanine in the inner layer of entryway, while the 
polar areas include two arginines and one aspartate 
located in the middle layer. These highly conserved fea- 
tures are present in the S. pneumoniae structure and all 
alanine racemase structures reported to date. An entry- 
way of this type has not been described in human 
PLP-containing enzymes. For example, human serine 
racemase, which like alanine racemase reacts with 
D-cycloserine, belongs to the Fold type II family of PLP- 
containing enzymes and would be not be expected to 
cross-react with any specific alanine racemase inhibitors 
[61]. The alanine racemase topology is termed Fold type 
III and is unique among PLP-containing enzymes. It 
seems likely, therefore, that designing inhibitors that 
interact with conserved motifs found in the entryway 
could represent a potential source of specificity in the 
drug design process. 

Interfering with active site assembly would, in the case 
of alanine racemase, require compounds that inhibit 
dimer formation, none of which have been reported for 
alanine racemase to date. However, dimer inhibitors 
have been reported in other systems such as HIV pro- 
tease [53-55]. Finally, a compound that could enter the 
active site of alanine racemase then undergo a confor- 
mational switch rendering the enzyme inactive would 
make an effective inhibitor, but this type of inhibitor has 
not yet been reported for this class of enzyme. 

Conclusions 

Alanine racemase is a promising target for antibacterial 
drugs because it is both essential in bacteria and absent in 
humans. We report the high-resolution crystal structure 
of alanine racemase from S. pneumoniae. Overall, the 
structure shares the conserved active site and topology 
found across all alanine racemases. Known alanine race- 
mase inhibitors such as D-cycloserine, alanine phospho- 
nate, and other substrate analogues are not specific, acting 
on other PLP-containing enzymes such as transaminases, 
also found in humans [59,62]. In order to be clinically rele- 
vant, new inhibitors of alanine racemase with more specifi- 
city need to be developed. This structure is an essential 



starting point for the design of more specific inhibitors of 
alanine racemase in S. pneumoniae. Our investigations 
have identified three potential areas in the Alr SP structure 
that could be targeted in a structure-based inhibitor 
design: the active site, the residues forming the dimer 
interface, and the active site entryway in particular, since 
designing a plug' to fit the funnel shape of this feature is 
intuitively attractive. 

Methods 

Protein expression, purification and crystallization 

The expression, purification and crystallization of Alr SP 
have been described previously [21]. Briefly, the gene 
encoding Alr SP was cloned into pET17 (Novagen) and 
the resulting vector transformed into E. coli BL21 (DE3) 
pLysS cells (Novagen). Overexpression of Alr S p was 
induced in a culture of these cells, which were then 
lysed to extract the protein. The recombinant Alr SP was 
purified using ammonium sulfate precipitation, anion- 
exchange chromatography, hydrophobic interaction 
chromatography, and finally, size-exclusion chromato- 
graphy. Crystals of Alr S p were grown at 4°C in 1.2 M 
Na Citrate, 0.1 M MES, pH 7.2, and 10% glycerol (pro- 
tein concentration 23 mg/ml, drop size 4 ul + 4 
using the sitting drop vapor diffusion method, then 
flash-frozen in liquid N 2 for data collection. No addi- 
tional cryoprotectant was required. 

Data collection and processing 

Diffraction data were collected to 2.0 A resolution at 
100 K using a Rigaku FR-E generator and an HTC 
detector at 45 kV and 45 mA with Cu Ka radiation at 
Rigaku MSC (The Woodlands, TX). The crystals 
belonged to the space group P3i21 with the unit cell 
parameters a = b = 119.97 A, c = 118.10 A, a = p = 90° 
and y = 120°. The data were processed and merged 
using the HKL package version 1.96.6 [63]. Data collec- 
tion and processing statistics are listed in Table 1. 

Structure determination and refinement 

The structure of Alr SP was solved by molecular replace- 
ment using CNS version 1.1 [42]. Alr GS (PDB ID 1SFT) 
[29] without the PLP cofactor was used as a search 
model, and two monomers per asymmetric unit were 
assumed, as suggested by a Matthews coefficient [64] of 

3.0 with a solvent content of 59.0%. Cross-rotation and 
translation searches were completed and the best solu- 
tion was used as an initial model for model building. 
After rigid body refinement in CNS, ARP/wARP version 

6.1 [65] was used to trace the initial protein model and 
build side chains. Further refinement was carried out 
using simulated annealing and conjugation gradient 
minimization. When 97% of residues were built, the co- 
factor PLP and the carbamylated lysine were placed, and 
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positional and B-factor refinements were continued 
resulting in an R and Rf ree of 31.9 and 33.9%, respec- 
tively. Water molecules were added using the water- 
picking script in CNS, and further cycles of positional 
and B iso refinements brought the R and Rf ree to 20.7 and 
25.7%, respectively. Since previous alanine racemase 
structures have shown indications of subdomain move- 
ment, we tried TLS refinement [43]. We used the TLS 
motion determination server [66,67] to produce modified 
PDB files and TLS input files for the structure partitioned 
into either one, five or twenty TLS groups, then further 
refined these models in Refmac5 version 5.5.0109 [44]. 
All models resulted in similar improvements in R and 
Rfree so we used the simplest option, which treated all 
protein atoms found in the asymmetric unit as a single 
rigid body (one TLS group). PLP and Lys40 were 
replaced with an LLP residue (PLP covalently bound to 
lysine), and TLS refinement was completed using 
Refmac5. The final model has an R and R free of 16.8 and 
20.0%, respectively. Refinement statistics are listed in 
Table 1. Structure factors and final atomic coordinates 
for AlrSP have been deposited in the Protein Databank 
(PDB ID: 3S46). B-factor values and correlation coeffi- 
cients were calculated using the programs Baverage and 
Overlapmap from the CCP4 suite [44]. 

Structural and sequence comparisons 

The multiple structure-based sequence alignment and 
structural superpositions of Alr SP with closely related 
structures were performed using the protein structure 
comparison service (SSM) at the European Bioinfor- 
matics Institute (http://www.ebi.ac.uk/msd-srv/ssm) [68]. 
The PDB IDs of the structures used were: 1SFT (Alr GS ), 
1VFH (Alr SL ), 3HA1 (Alr BA )> and 3E5P (Alr EF ). Table 3 
lists the residues from these structures used in the 
superpositions. Intermonomer interactions were ana- 
lysed using the Protein Interfaces, Surfaces and Assem- 
blies service (PISA) at the European Bioinformatics 
Institute (http://www.ebi.ac.uk/msd-srv/prot_int/pistart. 
html) [69], and the Protein-Protein interface analysis 
server (PROTORP) Server (http://www.bioinformatics. 
sussex.ac.uk/protorp/ index.html) [70]. 

Figure preparation 

Representations of molecules were prepared using the 
programs PyMOL [71] and BKChem (http://bkchem.zir- 
ael.org/index.html). The sequence alignment was visua- 
lized using Jalview [72]. The electrostatic potential of 
the Alr SP surface was calculated using the Adaptive 
Poisson-Boltzmann Solver (APBS) [73] through PyMOL. 
Default configurations were used for calculations. PQR 
files for use with APBS were generated using the PDB 
2PQR Server (http://kryptonite.nbcr.net/pdb2pqr/) [74] 



and the Dundee PRODRG2 Server (http://davapcl. 
bioch.dundee.ac.uk/prodrg/) [75] . 
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